Methods for transferring reserves when moving virtual machines across systems

ABSTRACT

According to one embodiment, a method includes receiving device information from a source system; building a virtual device on a host system based at least in part on the received device information; at least one of: transferring a reserve of a device stored on a storage device to a target system; and receiving results of transferring the reserve to the target system; determining if the reserve is held by the target system; recording the reserve if the reserve is held by the target system; and sending a notification that the reserve is not held by the target system if the reserve is not held by the target system. Other methods are also described according to various embodiments.

RELATED APPLICATIONS

This application is a continuation of copending U.S. patent application Ser. No. 12/943,874, filed Nov. 10, 2010, which is herein incorporated by reference.

BACKGROUND

The present invention relates to data storage systems, and more particularly, this invention relates to methods and systems for maintaining data integrity when moving virtual machines across systems.

In many storage systems, there are instances in which a device in a storage unit is held such that a specific host system has exclusive access to the device. This is typically accomplished by issuing a command from a host system to a control unit which controls the device. The exclusive access to the device is maintained by the host system until the host releases this exclusive access. In general, this command may be held in order to maintain consistency of data on this device when changes are occurring, e.g., another server is not able to access the data on this held device while the host system makes changes to the data on the device. In addition to holding this device, there are conditions in which the host system may transfer this exclusive access to this device to another (host) system. In typical storage systems, this can be done by the target host system querying the current state of the device that is being held to verify that the source host is actually holding the access to this device. After this point, the target host system or source host system issues an unconditional command to take or reassign the access to this device unconditionally. By unconditionally, what is meant is that this command takes control of or reassigns the access to this device regardless of any other ownership or control by another entity, e.g., this unconditional command takes the access to this device no matter which entity is holding the device.

BRIEF SUMMARY

In one embodiment, a method includes receiving device information from a source system, building a virtual device on a host system based at least in part on the received device information, transferring a reserve of a device stored on a storage device to a target system and/or receiving results of transferring the reserve to the target system, determining if the reserve is held by the target system, recording the reserve if the reserve is held by the target system, and sending a notification that the reserve is not held by the target system if the reserve is not held by the target system.

Other aspects and embodiments of the present invention will become apparent from the following detailed description, which, when taken in conjunction with the drawings, illustrates by way of example the principles of the invention.

BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS

FIG. 1 illustrates a system architecture, in accordance with one embodiment.

FIG. 2 shows a representative hardware environment that may be associated with the systems of FIG. 1, in accordance with one embodiment.

FIG. 3 shows a flowchart of a reserve transfer, according to one embodiment.

FIG. 4 shows a flowchart of a method for transferring or copying a virtual device, according to one embodiment.

FIG. 5 shows a flowchart of a reserve push command, according to one embodiment.

FIG. 6 shows a flowchart of a method for transferring or copying a virtual device, according to one embodiment.

FIG. 7 shows a flowchart of a reserve pull command, according to one embodiment.

FIG. 8 shows a flowchart of a reserve command, according to one embodiment.

FIG. 9 shows a flowchart for a release command, according to one embodiment.

FIG. 10 shows a flowchart for an unconditional reserve command, according to one embodiment.

DETAILED DESCRIPTION

The following description is made for the purpose of illustrating the general principles of the present invention and is not meant to limit the inventive concepts claimed herein. Further, particular features described herein can be used in combination with other described features in each of the various possible combinations and permutations.

Unless otherwise specifically defined herein, all terms are to be given their broadest possible interpretation including meanings implied from the specification as well as meanings understood by those skilled in the art and/or as defined in dictionaries, treatises, etc.

It must also be noted that, as used in the specification and the appended claims, the singular forms “a,” “an” and “the” include plural referents unless otherwise specified.

The following description describes methods and systems for maintaining data integrity when moving virtual devices across systems. According to one embodiment, a manage reserve command may be implemented in existing storage devices, storage systems, logic, etc. The manage reserve command may be a push and/or pull reserve command, as described in detail herein. This command may be used in various embodiments. A source system that holds a reserve on a storage device may be able to issue this command in order to transfer the device reserve to another host (target) system or the target host system may issue this command to take the reserve if the specified source system holds the reserve on the device. This enables the system to be able to transfer this reserve atomically without the possibility of losing exclusive access to the device, so that data integrity is maintained throughout the transfer of the reserve, according to preferred embodiments.

In one general embodiment, a host system includes logic adapted for receiving device information from a source system, logic adapted for building a virtual device based at least in part on the received device information, logic adapted for transferring a reserve of a storage device to the host system and/or receiving results of transferring the reserve to the host system, logic adapted for determining if the reserve is held by the host system, logic adapted for recording the reserve if the reserve is held by the host system, and logic adapted for sending a notification that the reserve is not held by the host system if the reserve is not held by the host system.

In another general embodiment, a source system includes logic adapted for creating a reserve of a storage device, logic adapted for sending device information to a target system, logic adapted for receiving an indication that the virtual device build is complete from the target system, logic adapted for transferring the reserve to the target system, and logic adapted for sending results of transferring the reserve to the target system.

In another general embodiment, a method includes receiving device information from a source system, building a virtual device on a host system based at least in part on the received device information, transferring a reserve of a device stored on a storage device to a target system and/or receiving results of transferring the reserve to the target system, determining if the reserve is held by the target system, recording the reserve if the reserve is held by the target system, and sending a notification that the reserve is not held by the target system if the reserve is not held by the target system.

In yet another general embodiment, a computer program product for transferring a virtual device includes a computer readable storage medium having computer readable program code embodied therewith, the computer readable program code includes computer readable program code configured to receive device information from a source system, computer readable program code configured to build a virtual device on a target system based at least in part on the received device information, computer readable program code configured to transfer a reserve of a storage device and/or receive results of transferring the reserve to the target system, computer readable program code configured to determine if the reserve is held by the target system and to record the reserve if the reserve is held by the target system, and computer readable program code configured to send a notification that the reserve is not held by the target system if the reserve is not held by the target system.

According to one embodiment, a manage reserve command may be implemented in existing storage devices, storage systems, logic, etc. The manage reserve command may be a push and/or pull reserve command, as described in detail herein. This command may be used in various embodiments. A source system that holds a reserve on a storage device may be able to issue this command in order to transfer the device reserve to another host (target) system or the target host system may issue this command to take the reserve if the specified source system holds the reserve on the device. This enables the system to be able to transfer this reserve atomically without the possibility of losing exclusive access to the device, so that data integrity is maintained throughout the transfer of the reserve, according to preferred embodiments.

As will be appreciated by one skilled in the art, aspects of the present invention may be embodied as a system, method or computer program product. Accordingly, aspects of the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment (including firmware, resident software, micro-code, etc.) or an embodiment combining software and hardware aspects that may all generally be referred to herein as “logic,” a “circuit,” “module,” or “system.” Furthermore, aspects of the present invention may take the form of a computer program product embodied in one or more computer readable medium(s) having computer readable program code embodied thereon.

Any combination of one or more computer readable medium(s) may be utilized. The computer readable medium may be a computer readable signal medium or a computer readable storage medium. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. More specific examples (a non-exhaustive list) of the computer readable storage medium would include the following: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this document, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device.

A computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated signal may take any of a variety of forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device.

Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.

Computer program code for carrying out operations for aspects of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C++ or the like and conventional procedural programming languages, such as the “C” programming language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the latter scenario, the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider).

Aspects of the present invention are described below with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the invention. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.

These computer program instructions may also be stored in a computer readable medium that can direct a computer, other programmable data processing apparatus, or other devices to function in a particular manner, such that the instructions stored in the computer readable medium produce an article of manufacture including instructions which implement the function/act specified in the flowchart and/or block diagram block or blocks.

The computer program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other devices to cause a series of operational steps to be performed on the computer, other programmable apparatus or other devices to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide processes for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.

FIG. 1 illustrates a system architecture 100, in accordance with one embodiment. As shown in FIG. 1, one or more networks 102 are provided for communicating with a plurality of systems, devices, etc. A gateway may be coupled between the networks 102 and any systems, devices, etc. In the context of the present network architecture 100, the network 102 may take any form including, but not limited to a local area network (LAN), a wide area network (WAN) such as the Internet, a public switched telephone network (PSTN), an internal telephone network, a storage area network (SAN), etc. The gateway may function as a router, which is capable of directing a given packet of data that arrives at the gateway, and a switch, which furnishes the actual path in and out of the gateway for a given packet.

Further included is at least a source system 104 and a target system 108. In some embodiments, a third system 106 may be included, along with any number of additional systems, as would be apparent to one of skill in the relevant art. Each system 104, 106, 108 may be connected to the network 102 and/or to each other. For purposes of the present description, transfers are described between the source system 104 and the target system 108, with the third system 106 being capable of disrupting operations between the source system 104 and target system 108, according to various embodiments. It should be noted that each system 104, 106, 108 may be a data server, which may include any type of computing device/groupware, a host system, or any other computing system.

A control unit 110 may be coupled to each of the one or more storage devices 112 a, 112 b, 112 c, . . . , 112 n, but is not necessary. Each storage device 112 a, 112 b, 112 c, . . . , 112 n, may have a separate control unit, or may include control unit functionality, thereby rendering the control unit 110 unnecessary. In any case, a control unit 110, multiple control units, or no control may be used to communicate and/or control the storage devices 112 a, 112 b, 112 c, . . . , 112 n, based on commands issued by one or more of the systems 104, 106, 108, according to various embodiments.

Any number of storage devices may be used, and any type of storage device may be used, such as a direct access storage device (DASD), tape drive, hard disk drive, optical disk drive, non-volatile storage, etc., arranged in any configuration as would be apparent to one of skill in the art. It should be noted that the one or more storage devices 112 a, 112 b, 112 c, . . . , 112 n may be shared by two or more of the systems 104, 106, 108, according to one embodiment.

Each system 104, 106, 108, according to one embodiment, may be assigned a unique Path Group Identifier (PGID). This PGID may be used to identify the system under certain circumstances and/or situations. For example, of particular interest in the present disclosure, the PGID of a system, such as the first system 104, may be used to identify that the first system 104 holds a reserve on a particular device, such as storage device 112 a. In this way, the PGID may be used as an identifier of what the system has rights to, has reserved, has access to, is currently accessing, etc. Any type of identifier may be used as the PGID as would be apparent to one of skill in the art; for example, in an IBM z/Series architecture, the PGID is a well known parameter, but is not so limited if the embodiments and/or methods described herein are used in a system that does not use an IBM z/Series architecture.

A peripheral or series of peripherals, e.g., facsimile machines, printers, networked and/or local storage units or systems, etc., may be coupled to the network 102, according to various embodiments. It should be noted that databases and/or additional components may be utilized with, or integrated into, any type of network element coupled to the network 102. In the context of the present description, a network element may refer to any component of a network.

According to some approaches, methods and systems described herein may be implemented with and/or on virtual systems and/or systems which emulate one or more other systems, such as a UNIX system which emulates an IBM z/OS environment, a UNIX system which virtually hosts a MICROSOFT WINDOWS environment, a MICROSOFT WINDOWS system which emulates an IBM z/OS environment, etc. This virtualization and/or emulation may be enhanced through the use of VMWARE software, in some embodiments.

In more approaches, one or more networks 102 may represent a cluster of systems commonly referred to as a “cloud.” In cloud computing, shared resources, such as processing power, peripherals, software, data, servers, etc., are provided to any system in the cloud in an on-demand relationship, thereby allowing access and distribution of services across many computing systems. Cloud computing typically involves an Internet connection between the systems operating in the cloud, but other techniques of connecting the systems may also be used.

FIG. 2 shows a representative hardware environment associated with a system 104, 106, 108 or any user device of FIG. 1, in accordance with one embodiment. This figure illustrates a typical hardware configuration of a system 200 having a central processing unit 210, such as a microprocessor, and a number of other units interconnected via a system bus 212. Of course, a system may include more or less units than those shown in FIG. 2, according to various embodiments, and the units shown in FIG. 2 are illustrative only, and not meant to be limiting on any embodiments and/or approaches described herein.

The system 200 shown in FIG. 2 includes a Random Access Memory (RAM) 214, Read Only Memory (ROM) 216, an I/O adapter 218 for connecting peripheral devices such as disk storage units 220 to the bus 212, a user interface adapter 222 for connecting a keyboard 224, a mouse 226, a speaker 228, a microphone 232, and/or other user interface devices such as a touch screen and a digital camera (not shown) to the bus 212, communication adapter 234 for connecting the workstation to a communication network 235 (e.g., a data processing network) and a display adapter 236 for connecting the bus 212 to a display device 238.

The system 200 may have resident thereon an operating system such as the Microsoft Windows® Operating System (OS), a MAC OS, a UNIX OS, etc. It will be appreciated that a preferred embodiment may also be implemented on platforms and operating systems other than those mentioned. A preferred embodiment may be written using JAVA, XML, C, and/or C++ language, or other programming languages, along with an object oriented programming methodology. Object oriented programming (OOP), which has become increasingly used to develop complex applications, may be used.

It will further be appreciated that embodiments presented herein may be provided in the form of a service deployed on behalf of a customer to offer service on demand.

In order to avoid the problems associated with typical device reserving, a new command may, in some approaches, be introduced into the software running on the host systems and on the data storage system having the device. This new command, according to one embodiment, has transfer parameter data that includes a flag (such as a single byte or string for indicating a status, referred to herein as a flag byte for simplicity) and a unique PGID for the host system being used but that did not issue the command. The flag byte, according to one embodiment, may specify a type of reserve transfer that is anticipated to occur on the device (e.g., Push or Pull). The PGID associated with the path on which the command was issued may be used as the identifier for the host system issuing the command.

When a Push Reserve bit is set in the flag byte field, then the PGID in the parameter data may identify the host system that the reserve will be given to, and the PGID for the path on which the command was issued may specify the host system in which the reserve is being held.

When the Pull Reserve bit is set in the flag byte field, then the PGID in the parameter data may identify the host system that holds the reservation, and the PGID for the path on which the command was issued may specify the host system to which the reserve will be granted.

In addition, results of the Push and Pull commands are returned to the system issuing the respective command, thereby allowing the system to have knowledge of the outcome, e.g., whether the command completed or not.

In order to handle time periods in which the reserve may be taken by another host system, the command may be failed by a control unit if the device is not reserved to the specified host system that software anticipates to be holding the reservation on the device. In one approach, this command may be an atomic command, e.g., any read/write commands and/or Unconditional Reserve operations that are issued to the device before the atomic command has completed and the new reservation has taken effect may cause the atomic command to fail. In preferred embodiments, the initiating system will be notified of the success or failure of the atomic command so that it can take appropriate action(s) to maintain data integrity.

An atomic command, by its nature, guarantees isolation from concurrent processes. In some approaches, an atomic command may include a succeed-or-fail definition, e.g., the command either successfully changes the state of the system (succeeds), or the command has no tangible effect on the system (fails).

If this command completes successfully, there is no doubt that the device will be reserved to the new host system, according to preferred embodiments. If the reservation is taken by another host system after the command, data integrity may still be maintained through other methods. If the command fails, the target system may take appropriate action(s) to maintain data integrity.

Now referring to FIG. 3, a method 300 is shown for transferring or copying a virtual device from a source system to a target system, according to one embodiment. In the descriptions below, the source system and the target system may be host systems. Also, in the context of FIG. 3, FIGS. 8-10 show operations according to several embodiments, which are described in turn.

Referring again to FIG. 3, in operation 302, the source system issues a reserve command for a particular tape drive, direct access storage device (DASD), or any other storage device which is shared by multiple host systems. This reserve command will effectively hold the device so that it can be accessed by the reserve holder, in this case the source system. In one example, the reserve command may function as shown in FIG. 8. In addition, the system issues a release command when the reserve is no longer needed, such as the release command described in FIG. 9. Also, the source system sends device information to the target system, as shown in FIG. 3 in operation 304. In operation 306, the target system builds a virtual device that is equivalent to a virtual device on the source system using information received from the source system, and then sends an indication to the source system that the build is complete once it is completed, which takes a period of time which may include some delay, in operation 308. In operation 310, the source system retrieves the reserve state for the particular device, e.g., it retrieves the reserve state using a simple query. As shown in operation 312, at any time, a third system may issue an unconditional reserve command for the particular device that was reserved by the source system. This third system can then make changes to the data on the particular device. If operation 312 is executed after operation 310 and before operation 320, then a data integrity problem arises. Then, in operation 314, when the reserve state is sent by the source system, and in operation 316, when the reserve state is received by the target system, the data on the particular device may have been changed.

In operation 318, the reserve state of the device on the source system at the tune operation 310 was executed is checked. In operation 320, if it was held (e.g., the third system did not get the reserve state until after operation 310), an unconditional reserve command is issued, such as the unconditional reserve command described in FIG. 10, by the target system to reserve the particular device so that data can be accessed on the device. Referring again to FIG. 3, if the reserve is held by the third system at the time operation 310 was executed, then in operation 322, a notification is sent indicating that the reserve has been lost (e.g., it is not held by the target system). This allows for the previous reserve holder to take appropriate action(s). Because of the possible actions by the third system during the time between operations 310 and 320, which may interrupt control of the reserve, method 300 does not have data integrity.

Now referring to FIG. 8, a method 800 is shown which represents a simplified unit flowchart for a reserve command, according to one embodiment. Of course, the method 800 may include more or less operations than those described below and shown in FIG. 8, as would be known to one of skill in the art.

FIG. 8 presents an example of a reserve command for the IBM z/series DASD reserve command that works at the device level. It is a synchronous command that does not return until the reserve state is granted to the invoker of the command. It is one method of reserving a device. Other methods may be different, including being asynchronous, according to various approaches.

In operation 802, an invoker PGID is retrieved. The invoker may be a host system, a server, etc., and the PGID may be returned using any method known in the art.

In operation 804, a first status check is performed to determine if a storage device is reserved. The storage device may be a DASD, a tape drive, or any other storage device which is capable of being shared by multiple host systems.

In operation 806, if the first status check indicates that the storage device is reserved, a second status check is performed to determine if the invoker PGID is the same as a reserve holder PGID. If so, then the reserve has been completed properly.

If the first status check indicates that the storage device is not reserved, the method 800 proceeds to operation 810.

If the second status check indicates that the invoker PGID is not the same as a reserve holder PGID, the method 800 proceeds to operation 808.

In operation 808, the method 800 is delayed until the reserve is available. Periodic status checks may be performed to determine when the reserve becomes available, such as every 1 ms, 0.5 ins, 1 second, etc., based on some other action completing, an indication being issued, etc., or any other way as known in the art.

In operation 810, the reserve holder PGID is set to the invoker PGID, thereby completing the reserve command.

Now referring to FIG. 9, a method 900 is shown which represents a simplified unit flowchart for a release command, according to one embodiment. Of course, the method 900 may include more or less operations than those described below and shown in FIG. 9, as would be known to one of skill in the art.

In operation 902, a reserve is set to not held by any system, e.g., if the reserve is currently held by any system, it is released from being held.

In operation 904, a status check is performed to determine if there are any pending reserves, such as reserve requests issued by a host system. If no reserves are pending, then the release command is completed.

In operation 906, if reserves are pending, then the PGID of the system which issued the next pending reserve is retrieved.

In operation 908, the reserve holder PGID is set to the PGID of the system which issued the next pending reserve, and the release command is completed.

Of course, the release command may be repeated until all pending reserves are cleared, according to some approaches.

Now referring to FIG. 10, a method 1000 is shown which represents a simplified control unit flowchart for an unconditional reserve command, according to one embodiment. Of course, the method 1000 may include more or fewer operations than those described below and shown in FIG. 10, as would be known to one of skill in the art.

Method 1000 may be used to avoid situations where an unconditional reserve takes control of a reserve without the previous reserve holder having any notification of the unconditional reserve being processed.

In operation 1002, when an unconditional reserve is issued by any system, a status check is performed to determine if a system currently holds the reserve for which the unconditional reserve was issued. If the reserve is not currently being held, the method 1000 proceeds to operation 1008.

If the reserve is currently being held, then in operation 1004, a PGID for the reserve holder is retrieved.

In operation 1006, the reserve holder is notified that the reserve has been usurped by another system. In one approach, the PGID of the system which issued the unconditional reserve may be queried by the reserve holder system.

In operation 1008, the reserve holder PGID is set to the invoker system's PGID (e.g., the system which issued the unconditional reserve).

Now referring to FIG. 4, a method 400 is shown for transferring or copying a virtual device from a source system to a target system, according to one embodiment. In this embodiment, a tape drive, a storage library, a DASD, or any other storage device(s) which is shared by multiple hosts may be a device on which a virtual device is located. In the descriptions below, the source system, the target system, and/or the third system may be host systems.

The method 400 may be carried out in any desired environment, including but not limited to, those shown in FIGS. 1-2, according to various embodiments. Of course, the method 400 may include more or less operations than those described below and shown in FIG. 4, as would be known to one of skill in the art.

In operation 402, the source system issues a reserve command for a particular tape drive, direct access storage device (DASD), or any other storage device which is capable of being shared by multiple host systems. This reserve command will effectively hold the device so that it can be accessed by the reserve holder, in this case the source system.

In operation 404, the source system sends device information to the target system. The device information which is sent to the target system provides details of the virtual device which is to be copied or transferred to the target system.

In operation 406, the target system builds a virtual device that is equivalent to a virtual device on the source system using information received from the source system. Any method to construct a virtual device may be used, as known to one of skill in the art.

In operation 408, the target system sends an indication to the source system that the build is complete once it is completed.

In operation 410, the source system issues a transfer reserve command for the particular device so that it can transfer it to the target system, which is a push command. For details of a push command, refer to FIG. 5 which shows a push command, according to one embodiment. In this case, instead using a two-step process of getting the reserve state and then doing an unconditional reserve (as performed in method 300 of FIG. 3), a new reserve push command is issued which forces a response on whether the reserve push worked or not. Referring again to FIG. 4, at this point in method 400, it is known whether the reserve push command worked or not, since the reserve push command is an atomic command.

In operation 412, the transfer results are sent to the target system.

At any point during this process, an unconditional reserve command may be issued by a third system, as indicated by operation 414. However, this unconditional reserve command will not affect the method 400 because a control unit provides integrity at that level, even if the unconditional reserve command is issued after the reserve push worked. There are no data integrity problems because the control unit will provide notification of the unconditional reserve.

In operation 416, the transfer results are received by the target system. After knowing whether the reserve push command worked or not, the target system may perform appropriate actions based on the results.

In operation 418, a status check is performed on the reserve of whether it is held by the target system or not. In method 400, this check returns a positive result indicating the reserve is held unless an Unconditional Reserve command issued by the third system completes prior to the reserve push command completing.

In operation 420, if it is held (e.g., the third system did not get the reserve state), the reserve is recorded by the target system so that the data can be accessed on the particular device.

In operation 422, if the reserve is not held by the target system (e.g., the third system holds the reserve, the third system held the reserve and then released it, another system took the reserve from a system holding the reserve, etc.; all occurring prior to the reserve push), then a notification is sent indicating that the reserve is not held. This allows for the previous reserve holder to take appropriate action.

Now referring to FIG. 5, a method 500 is shown which represents a reserve push command, as shown in operation 410 of FIG. 4, according to one embodiment. This represents an atomic command as far as affected systems are concerned, e.g., a command which requires certain criteria to be met before the command is completed. As shown in FIG. 5, only one I/O is allowed to a device from any Path Group at any given time. Also, the control unit locks out any unconditional reserves that may be issued for the device, preferably by serializing the commands.

The method 500 may be carried out in any desired environment, including but not limited to, those shown in FIGS. 1-2, and 4, according to various embodiments. Of course, the method 500 may include more or less operations than those described below and shown in FIG. 5, as would be known to one of skill in the art.

In operation 502, a PGID is retrieved for each of an invoker (the system that is issuing the request, such as the source system), a reserve holder (whichever system holds the reserve), and a new reserve holder command input (a system which will access the reserve, such as the target system).

In operation 504, a status check is performed to determine if the reserve holder PGID is the invoker PGID, which indicates that the reserve holder is the source system. If it is not, then an unconditional reserve was received prior to this check, the reserve was released prior to the check, etc.

In operation 506, if the reserve holder PGID is the same as the invoker PGID, then the reserve holder PGID is set as the command input PGID. Then, in operation 508, an indication that the reserve transfer was successful is returned.

In operation 510, if the reserve holder PGID is not the same as the invoker PGID, an indication that the reserve transfer was not successful is returned.

Now referring to FIG. 6, a method 600 is shown for transferring or copying a virtual device from a source system to a target system, according to one embodiment. In this embodiment, a tape drive, a DASD, or any other storage device which is shared by multiple hosts may be the device whose data integrity is to be protected. In the descriptions below, the source system and the target system may be host systems.

The method 600 may be carried out in any desired environment, including but not limited to, those shown in FIGS. 1-2, according to various embodiments. Of course, the method 600 may include more or less operations than those described below and shown in FIG. 6, as would be known to one of skill in the art.

In operation 602, the source system issues a reserve command for a particular tape drive, DASD, or any other storage device which is shared by multiple host systems. This reserve command effectively hold the device so that it can be accessed by the reserve holder, in this case the source system.

In operation 604, the source system sends device information to the target system. The device information which is sent to the target system provides details of the virtual device which is to be copied or transferred to the target system.

In operation 606, the target system builds a virtual device that is equivalent to a virtual device on the source system using information received from the source system. Any method to construct a virtual device may be used, as known to one of skill in the art.

In operation 608, the target system issues a transfer reserve command for the particular device so that it can transfer it from the source system, which is a pull command. In this case, instead using a two step process of getting the reserve state and then doing an unconditional reserve (as performed in method 300 of FIG. 3), a new reserve pull command is issued which forces a response on whether the reserve pull worked or not. Therefore, referring again to FIG. 6, at this point in method 600, it is known whether the reserve pull command worked or not, since the reserve pull command is an atomic command. One example of a reserve pull command is shown in FIG. 7.

This pull command may be preferable to a push command as shown in FIG. 4, according to some embodiments, such as those working with shared devices, for example. The word ‘shared’ when describing devices in the description of FIG. 6 may have a different meaning from the word ‘shared’ in reference to FIG. 1. In this case, ‘shared’ may refer to sharing a single real device between two or more users (such as guests, hosts, virtual devices, etc., depending on the terminology deployed) on a system, with the system simulating the reserve function. However, this is only an example, because the pull command could also be used by real host systems sharing a real device, as in FIG. 1.

Referring again to FIG. 6, at any point during this process, an unconditional reserve command may be issued by a third system, as indicated by operation 610 of FIG. 6. However, this unconditional reserve command will not affect the method 600 because a control unit provides integrity at that level, even if the unconditional reserve command is issued after the reserve push worked, so that there are no data integrity problems.

In operation 612, a status check is performed on the reserve of whether it is held by the target system or not. In method 600, this check returns a positive result indicating the reserve is held unless an Unconditional Reserve command issued by the third system completes prior to the reserve pull command completing.

In operation 614, if it is held (e.g., the third system did not get the reserve state), the reserve is recorded, by the target system so that the data can be accessed on the particular device.

In operation 616, if the reserve is not held by the target system (e.g., the third system holds the reserve, the third system held the reserve and then released it, another system took the reserve from a system holding the reserve, etc.), then a notification is sent indicating that the reserve is not transferred. This allows for the previous reserve holder to take appropriate action.

Now referring to FIG. 7, a method 700 is shown which represents a reserve pull command, as shown in operation 608 of FIG. 6, according to one embodiment. This represents an atomic command, e.g., a command which requires certain criteria to be met before the command is completed. As shown in FIG. 7, only one I/O is allowed to a device from any Path Group at any given time. Also, the control unit locks out any unconditional reserves that may be issued for the device, preferably by serializing the commands.

The method 700 may be carried out in any desired environment, including but not limited to, those shown in FIGS. 1-2, and 6, according to various embodiments. Of course, the method 700 may include more or less operations than those described below and shown in FIG. 7, as would be known to one of skill in the art.

In operation 702, a PGID is retrieved for each of an invoker (the system that is the new reserve holder, such as the target system), a reserve holder (whichever system holds the reserve), and a current reserve holder command input (a system which holds the reserve, such as the source system).

In operation 704, a status check is performed to determine if the current reserve holder command input PGID is the reserve holder PGID, which indicates that the reserve holder is the source system. If it is not, then an unconditional reserve was received prior to this check, the reserve was released prior to the check, etc.

In operation 706, if the current reserve holder command input PGID is the same as the reserve holder PGID, then the reserve holder PGID is set as the invoker PGID. Then, in operation 708, an indication that the reserve transfer was successful is returned.

In operation 710, if the current reserve holder command input PGID is not the same as the reserve holder PGID, an indication that the reserve transfer was not successful is returned.

According to various embodiments, a system may have logic adapted to perform some or all operations from any of methods 300, 400, 500, 600, 700, 800, 900, and 1000, alone or in conjunction with operations of any other methods.

In more embodiments, a computer program product for transferring a virtual device may include a computer readable storage medium having computer readable program code embodied therewith. The computer readable program code may be configured to perform some or all operations from any of methods 300, 400, 500, 600, 700, 800, 900, and 1000, alone or in conjunction with operations of any other methods.

The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems that perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.

While various embodiments have been described above, it should be understood that they have been presented by way of example only, and not limitation. Thus, the breadth and scope of an embodiment of the present invention should not be limited by any of the above-described exemplary embodiments, but should be defined only in accordance with the following claims and their equivalents. 

1. A method, comprising: receiving device information from a source system; building a virtual device on a host system based at least in part on the received device information; at least one of: transferring a reserve of a device stored on a storage device to a target system; and receiving results of transferring the reserve to the target system; determining if the reserve is held by the target system; recording the reserve if the reserve is held by the target system; and sending a notification that the reserve is not held by the target system if the reserve is not held by the target system.
 2. The method as recited in claim 1, wherein the reserve is transferred from the source system to the target system and the source system initiates transferring the reserve.
 3. The method as recited in claim 2, further comprising sending an indication to the source system that the virtual device build is complete prior to receiving results of transferring the reserve.
 4. The method as recited in claim 1, wherein transferring the reserve comprises: retrieving a path group identifier (PGID) for each of: an invoker system, a reserve holder system, and a new reserve holder command input; determining if the PGID of the reserve holder system is the same as the PGID of the invoker system; and if the PGID of the reserve holder system is the same as the PGID of the invoker system: setting the PGID of the reserve holder system to the PGID of the new reserve holder command input; and sending a notification that transferring the reserve was successful; otherwise, sending a notification that transferring the reserve was not successful.
 5. The method as recited in claim 4, wherein the invoker system is the source system.
 6. The method as recited in claim 1, wherein transferring the reserve comprises: retrieving a path group identifier (PGID) for each of: an invoker system, a reserve holder system, and a current reserve holder command input; determining if the PGID of the current reserve holder command input is the same as the PGID of the reserve holder system; and if the PGID of the current reserve holder command input is the same as the PGID of the reserve holder system: setting the PGID of the reserve holder system to the PGID of the invoker system; and sending a notification that transferring the reserve was successful; otherwise, sending a notification that transferring the reserve was not successful.
 7. The method as recited in claim 6, wherein the invoker system is the target system. 